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Abstract 

In recent work, Jon Kleinberg considered a small-world network model consisting of 
a d-dimensional lattice augmented with shortcuts. The probability of a shortcut being 
present between two points decays as a power, r~ Q , of the distance r between them. Klein- 
berg showed that greedy routing is efficient if a = d and that there is no efficient decen- 
tralised routing algorithm if a 7^ d. The results were extended to a continuum model by 
Franceschetti and Meester. In our work, we extend the result to more realistic models con- 
structed from a Poisson point process, wherein each point is connected to all its neighbours 
within some fixed radius, as well as possessing random shortcuts to more distant nodes as 
described above. 



1 Introduction 

A classical random graph model introduced by Erdos and Renyi consists of n nodes, with the 
edge between any pair of vertices being present with probability p(n), independent of other 
pairs. Recently, there has been considerable interest in alternative models where the nodes 
are given coordinates in an Euclidean space, and the probability of an edge between a pair 
of nodes u and v is given by a function g(-) of the distance r(u,v) between the nodes; edges 
between different node pairs are independent. Such 'random connection' or 'spatial random 
graph' models and variants thereof arise, for instance, in the study of wireless communication 
networks. 

The "small-world phenomenon" (the principle that all people are linked by short chains of 
acquaintances), which has long been a matter of folklore, was inaugurated as an area of experi- 
mental study in the social sciences through the pioneering work of Stanley Milgram [Jj. Recent 
works have suggested that the phenomenon is pervasive in networks arising in nature and tech- 
nology, and motivated interest in mathematical models of such networks. While Erdos-Renyi 
random graphs possess the property of having a small diameter (smaller than logarithmic in 
the number of nodes, above the connectivity threshold for p(n)), they are not good models for 
social networks because of the independence assumption. On the other hand, spatial random 
graphs are better at capturing clustering because of the implicit dependence between edges 
induced by the connection function g(-). 

Watts and Strogatz conducted a set of re- wiring experiments on graphs, and observed 
that by re-wiring a few random links in finite lattices, the average path length was reduced 
drastically (approaching that of random graphs). This led them to propose a model of "small- 
world graphs" which essentially consists of a lattice augmented with random links acting as 



* Statistical Laboratory, Centre for Mathematical Sciences, Wilberforce Road, Cambridge CB3 OWB UK 
E-mail: M. Draief @statslab . cam. ac .uk 

^Microsoft Research, 7 J.J. Thomson Avenue, Cambridge CB3 OFB E-mail: ajg@micorsoft.com 



1 



shortcuts, which play an important role in shrinking the average path link. By the length of 
a path we mean the number of edges on it, and distance refers to graph distance (length of 
shortest path) unless otherwise specified. 

The diameter of the Watts-Strogatz model in the 1-dimensional case was obtained by Bar- 
bour and Reinert [Q. Benjamini and Berger [2] considered a variant of this 1-dimensional 
model wherein the shortcut between any pair of nodes, instead of being present with constant 
probability, is present with probability given by a connection function g(-); they specifically 
considered connection functions of the form g(r) ~ (5r~ a , where (5 and a are given constants, 
and r(u, v) is the graph distance between u and v in the underlying lattice (i.e., the L\ distance). 

The general d-dimensional version of this model, on the finite lattice with n d points, was studied 
by Coppersmith et al. ;4j . They showed that the diameter of the graph is (i) ©(log nj log log n) 
if a = d, (ii) at most polylogarithmic in n if d < a < 2d, and (iii) at least polynomial in n if 
a > 2d. Finally, it was shown by Benjamini et al. [3] that the diameter is a constant if a < d. 

The sociological experiments of Milgram demonstrated not only that there is a short chain of 
acquaintances between strangers but also that they are able to find such chains. What sort 
of graph models have this property? Specifically, when can decentralised routing algorithms 
(which we define later) find a short path between arbitrary source and destination nodes? 

This question was addressed by Jon Kleinberg [5] for the class of finite d-dimensional lattices 
augmented with shortcuts, where the probability of a shortcut being present between two nodes 
decays as a power, r~ a of the distance r between them. Kleinberg showed that greedy routing 
is efficient if a = d and that there is no efficient decentralised routing algorithm if a ^ d. The 
results were extended to a continuum model by Franceschetti and Meester [3] . Note that these 
results show that decentralised algorithms cannot find short routes when a ^ d, even though 
such routes are present for a < 2d by the results of Benjamini et al. and Coppersmith et al. 
cited above; when a > 2d, no short routes are present. 

2 Our Model 

In this work, we consider a model constructed from a Poisson point process on a finite square, 
wherein each point is connected to all its neighbours within some fixed radius, as well as 
possessing random shortcuts to more distant nodes. More precisely: 

• We consider a sequence of graphs indexed by n 6 N. 

• Nodes form a Poisson process of rate 1 on the square [0, \/n\ 2 - 

• Each node x is linked to all nodes that are distance less that r n = \J c log n for a sufficiently 
large constant, c. In particular, if c > 1/tt, then this graph is connected with high 
probability (abbreviated whp, and meaning with probability going to 1 as n tends to 
infinity); see [SJ. These links are referred to as local edges and the corresponding nodes 
as the local contacts of x. 

• For two nodes u and v such that r(u,v) > y / clogn, the edge (u, v) is present with 
probability a n r(u,v)~ a A 1. Such edges are referred to as shortcuts. The parameter a n 
is chosen so that the expected number of shortcuts per node is equal to some specified 
constant, d. 
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The objective is to route a message from an arbitrary source node s to an arbitrary destination 
t using a small number of hops. We are interested in decentralised routing algorithms, which 
do not require global knowledge of the graph topology. It is assumed throughout that each 
node knows its location (co-ordinates) on the plane, as well as the location of all its neighbours, 
both local and shortcut, and of the destination t. We show that efficient decentralised routing 
is possible only if a = 2. More precisely, we show the following: 

• a = 2: there is a greedy decentralised algorithm to route a message from source to 
destination in 0(log 2 n) hops. 

• a < 2: any decentralised routing needs more than n 1 hops on average, for any 7 such 
that 7 < (2 - a)/6. 

• a > 2: any decentralised routing needs more than n 7 hops on average, for any 7 < 2 (a-\) • 

As noted by Kleinberg for the lattice model, the case a = 2 corresponds to a "scale-free" 
network: the expected number of shortcuts from a node x to nodes which lie between distance 
r and 2r from it is the same for any r. It was observed by Franceschetti and Meester in their 
continuum model that this property is related to the impossibility of efficient decentralised 
routing when a / 2 through the fact that shortcuts can't make sufficient progress towards 
the destination when a > 2 (they are too short) while they can't home in on small enough 
neighbourhoods of the destination when a < 2 (they are too long). Similar remarks apply to 
our model as well. 

A model very similar to ours was considered by Sharma and Mazumdar 9 who use it to describe 
an ad-hoc sensor network. The sensors are located at the points of a Poisson process and can 
communicate with nearby sensors through wireless links (corresponding to local contacts). In 
addition, it is possible to deploy a small number of wired links (corresponding to shortcuts), 
and the question they address is that of how to place these wired links in order to enable 
efficient decentralised routing. 

In the analysis presented below, we ignore edge effects for ease of exposition. This is equivalent 
to considering distances as being defined on the torus obtained by identifying opposite edges 
of the square. 

3 Efficiency of greedy routing when a = 2 

When a = 2, we show that the following approximately greedy algorithm succeeds whp in 
reaching the destination in a number of hops which is polylogarithmic in n, the expected 
number of nodes. 

Denote by C(u,r) the circle of radius r centred at node u. If there is no direct link from the 
source s to the destination t, then the message is passed via intermediate nodes as follows. 
At each stage, the message carries the address (co-ordinates) of the destination t, as well as a 
radius r which is initialised to r(s,t), the distance between s and t. Suppose the message is 
currently at node x and has radius r > \/c log n. (If r < \Jc log n, then the node which updated 
r would have contained t in its local contact list and delivered the message immediately.) 
If node x has a shortcut to some node y £ A(t,r), where the annulus A(t,r) is defined as 
A(t,r) = C(t, |) \ C(t, 5), then x forwards the message to y. If there is more than one such 
node, the choice can be arbitrary. Otherwise, it forwards the message to one of its local 
contacts which is closer to t than itself. When a node y receives a message, it updates r to r/2 
if r(y,t) < r/2, and leaves r unchanged otherwise. 
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In other words, if x can find a shortcut which reduces the distance to the destination by at least 
a half but by no more than three-quarters, it uses such a shortcut. Otherwise, it uses a local 
contact to reduce the distance to the destination. In that sense, the algorithm is approximately 
greedy. The reason for considering such an algorithm rather than a greedy algorithm that would 
minimize the distance to the destination at each step is to preserve independence, which greatly 
simplifies the analysis. Note that if a greedy step from x takes us to y (i.e., of all nodes to 
which x possesses a shortcut, y is closest to t), then the conditional law of the point process in 
the circle C(t, r(t, y)) is no longer unit rate Poisson. The fact that there are no shortcuts from 
x to nodes within this circle biases the probability law and greatly complicates the analysis. 
Our approximate greedy algorithm gets around this problem. 

Observe that if the message passes through a node x, the value of r immediately after visiting 
x lies between r(x,t) and 2r(x,t). 

We have implicitly assumed that any node can find a local contact closer to t than itself. We 
first show that this assumption holds whp if c is chosen sufficiently large. 

Fix c > and n G N. For two points x and y in the square [0, y/n\ 2 , and a realisation uj of the 
unit rate Poisson process on the square, define the properties 

CP n (x, y, uj) = {3 u £ uj : r(u, y) < r(x, y) and r(u, x) < \J ' c logn}, 

and 

? n {u)= /\ 9 n {x,y,u). 

(x,y):r(x,y)>^c\ogn 

Lemma 3.1. If c > is sufficiently large, then P((P n (-)) — > 1 as n tends to infinity. 

In words, with high probability, any two points x and y in the square [0, ^/n] 2 with r(x,y) > 
\] c log n have the property that there is a point u of the unit rate Poisson process within 
distance yj c log n of x which is closer than x to y. In particular, if x and y are themselves 
points of the Poisson process, then u is a local contact of x which is closer to y. The key point 
to note about the lemma is that it gives a probability bound which is uniform over all such 
node pairs. 

Proof. Suppose r(x,t) > \/clogn. Consider the circle C\ of radius \J c log n centred at x and the 
circle C2 of radius r(x, t) centred at t. For any point y / x in their intersection, r(y, t) < r(x, t). 
Moreover, the intersection contains a sector of C\ of angle 2tt/3. Denote this sector D\. Now 
consider a tessellation of the square [0, \/n\ 2 by small squares of side (3\/c log n. Note that for a 
sufficiently small geometrical constant (3 that doesn't depend on c or n {(3 = 1/2 suffices), the 
sector D\ fully contains at least one of the smaller squares. Hence, if every small square contains 
at least one point of the Poisson process, then every node at distance greater than sj c log n 
from t can find at least one local contact which is closer to t. Number the small squares in some 
order and let X{ denote the number of nodes in the i th small square, i = 1, . . . , n/(f3 2 c\og n). 
The number of squares is assumed to be an integer for simplicity. Clearly, the Xi are iid Poisson 
random variables with mean [3 2 c\ogn. Hence, by the union bound, 

n/(/3 2 c log n) 

P(3i:X l = 0)< V P(X t = 0) = e'^ n , 

^ p z c logn 

which goes to zero as n tends to infinity, provided that (3 2 c > 1. In particular, c > 4 suffices 
since we can take (3 = 1/2. □ 
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We now state the main result of this section. 



Theorem 3.2. Consider the small world random graph described above with a = 2, expected 
node degree d = 1, and c > sufficiently large, as required by Lem,ma, \3.1\ Then, the number 
of hops for message delivery between any pair of nodes is of order log 2 n whp. 

Proof. We first evaluate the normalisation constant a n by noting that the expected degree, d, 
of a node located at the centre of the square satisfies 



d < a n x 2-Kxdx = 7ra n (logn — loglogn — log(2c)), 

J \/c log n 



and so 

1 

a n > -, , (1) 

logn 

for all n sufficiently large, by the assumption that d = 1. 

Next, we compute the probability of finding a suitable shortcut at each step of the greedy 
routing algorithm. We think of the routing algorithm as proceeding in phases. The value of 
r is halved at the end of each phase. The value of r immediately after the message reaches a 
node x satisfies the relation r(x, t) S (r/2, r] at each step of the routing algorithm. We suppose 
that r > k^/cTogn, for some large constant k. 

Denote by Na the number of nodes in the annulus A(t, r) and observe that Na is Poisson with 
mean 37rr 2 /16. The distance from x to any of these nodes is bounded above by 3r/2, and so 
the probability that a shortcut from x is incident on a particular one of these nodes is bounded 
below by a n (3r/2)~ 2 . Thus, conditional on Na, the probability that x has a shortcut to one 
of the Na nodes in A(t, r) is bounded below by 

P(r,N A ) = l-(l-^) NA . (2) 

If x doesn't have such a shortcut, the message is passed via local contacts which are successively 
closer to t, and hence satisfy the same lower bound on the probability of a shortcut to A(t,r). 
Consequently, the number of local steps L x until a shortcut is found is bounded above by a 
geometric random variable with conditional mean l/p(r, Na)- Since N A ~ Pois(3vrr 2 /16), we 
have by a standard application of the Chernoff bound that 



P(N A < V/16) < exp(-^^ + ^log^i 



for any 7 < 3tt. 

Suppose first that r > kyj c log n for some large constant k. Taking 7 = 3n/2, we obtain 

_/, T 37rr 2 \ / 37r/c 2 clogn /1 . „ N \ 

P ( NA * -32") * 6XP ( 3Y^ (1 " l0g2) )- (3) 

Suppose first that Na < 37rr 2 /32. The number of local hops, L x , to route the message from 
x to A is bounded above by the number of nodes outside A, since the distance to t is strictly 
decreasing after each hop. Hence, 

< n — area(^4) < n. (4) 







3vrr 2 - 




E 


L x 


^< 32 . 


< 
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Next, if N A > 37rr 2 /32, then we have by © and © that 



p(r, Na) > 1 — exp 



~24~ 



> 1 — exp 



7T 



> 



7T 



24 log n / 48 log n ' 



where the last inequality holds for all n sufficiently large. Since the number of hops to reach A 
is bounded above by a geometric random variable with mean l/p(r, Na), we have 







3vrr 2 - 




E 


L x 


N A > 

32 . 


< 



48 



7T 



(5) 



Finally, we obtain from (J3J), Q and JSJ) that 



^ rr n / 3nk 2 c(l - log 2) , \ 48, 
E[L X \ < n exp — log nj + — log n. 

The first term in the sum above can be made arbitrarily small by choosing k large enough, 
so -Ef-Li] = O(logn). It can also be seen from the arguments above that L x = O(logn) whp. 
In other words, while r > k\Jc log n, the number of hops during each phase is of order logn. 
Moreover, the number of such phases is of order logn since the initial value of r is at most 
V2n, and r halves at the end of each phase. 

Hence, the total number of hops until r < k\/ c log n is of order log 2 n. Once the message 
reaches a node x with r(x, t) < k^/clogn, the number of additional hops to reach t is bounded 
above by the total number of nodes in the circle C(t, k\J clogra). By using the Chernoff bound 
for a Poisson random variable, it can be shown that this number is of order logn whp. This 
completes the proof of the theorem. □ 



4 Impossibility of efficient routing when a ^ 2 

We now show that if a < 2, then no decentralised algorithm can route between arbitrary 
source-destination pairs in time which is polylogarithmic in n. In fact, the number of routing 
hops is polynomial in n with some fractional power that depends on a. 

We now make precise what we mean by a decentralised routing algorithm. As specified earlier, 
each node knows the locations of all its local contacts with distance \J c log n and of all its 
shortcut neighbours, as well as other nodes (if any) from which shortcuts are incident to it. 
A routing algorithm specifies a (possibly random) sequence of nodes s = xo, x\, . . . , Xk = 
t,Xk+i = t, . . ., where the only requirement is that each node Xi be chosen from among the 
local or shortcut contacts of nodes {xo, ■ ■ ■ , arj-i}- (This is the same definition as used by 
Kleinberg [H]). 

Theorem 4.1. Consider the small world random graph described above with a < 2, and arbi- 
trarily large constants c and d. Suppose the source s and destination t are chosen uniformly at 
random from the node set. Then, the number of hops for message delivery in any decentralised 
algorithm exceeds n 7 whp, for any 7 < (2 — a)/6. 

It is not important that the source and destination be chosen uniformly but only that the 
distance between them be of order n a whp for some a > 0. 

Proof. We first evaluate the normalisation constant a n by noting that the expected degree 
satisfies _ 

rVn/2 1 /r) (2-a)/2 N 

d>a n x- a 2nxdx = _ (clog n) (2-«)/2^ 



Vc log n - ' 1 v 2 2 Q 
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which, on simplification, yields that 



4d 

a n < ^=55^. (6) 

for all n sufficiently large. Note that a n is an upper bound on the probability that there is a 
shortcut between any pair of nodes. 

Suppose that the source s and destination t are chosen uniformly from all nodes on [0, y/n] 2 . 
Fix 5 G (7, 1/2) and define Cg = C(t,n s ) to be the circle of radius n s centred at t. It is clear 
that, for any e > 0, the distance r(s, Cg) from s to the circle Cg is bigger than nA 1 / 2 ) -6 whp. 
Suppose now that this inequality holds, but that there is a routing algorithm which can route 
from s to t in fewer than n 7 hops. Denote by s = xq,x±, . . . ,x m = t, the sequence of nodes 
visited by the routing algorithm, with m < ri 1 . We claim that there must be a shortcut from 
at least one of the nodes xq,x\, . . . ,x m -\ to the set Cg. Indeed, if there is no such shortcut, 
then t must be reached starting from some node outside Cg and using only local links. Since 
the length of each local link is at most yj c log n and the number of hops is at most n 7 , the 
total distance traversed by local hops is strictly smaller than n s (for large enough n, by the 
assumption that S > 7), which yields a contradiction. We now estimate the probability that 
there is a shortcut from one of the nodes xq, . . . , x m -\ to the set Cg. 

The number of nodes in the circle Cg, denoted Nc, is Poisson with mean 7rn 2S , so N c < 4ra 2<5 
whp. Now, by © and the union bound, 

P(3 shortcut between u and Cg\N c < 4n 25 ) < 16 d n ^ 5+a - 2 ^ 2 , 

for any node u. Applying this bound repeatedly for each of the nodes xq, x±, . . . , x m _i generated 
by the routing algorithm, we get, 

P{3 shortcut to Cg within n 7 hops)|A^ c < 4n 25 ) < 16 d n ( 2 7+45+a-2)/2_ ^ 

Now 7 < (2 — a)/6 by assumption, and S > 7 can be chosen arbitrarily. In particular, we can 
choose S so that 27 + 4<5 + a — 2 is strictly negative, in which case the conditional probability 
of a shortcut to Cg goes to zero as n — > 00. Since P(N C > 4n 2<5 ) also goes to zero, we conclude 
that the probability of finding an s — t route with fewer than n 7 hops also goes to zero. This 
concludes the proof of the theorem. □ 

Remarks: The theorem continues to hold if we assume 1-step lookahead. By this, we mean 
that when a node decides where to send the message at the next step, it can not only use the 
locations of all its local and shortcut contacts, but also the locations of their contacts. All this 
means is that after visiting n 7 nodes, the algorithm has knowledge about 0(n 7 log n) nodes. If 
none of these nodes has a shortcut into the set Cg, which is the case whp, then the arguments 
above still apply. The same is true for A;-step lookahead, for any constant k. 

Theorem 4.2. Consider the small world random graph described above with a > 2, and arbi- 
trarily large constants c and d. Suppose the source s and destination t are chosen uniformly at 
random from the node set. Then, the number of hops for message delivery in any decentralised 
algorithm exceeds n 7 whp, for any 7 < (a — 2)/(2(a — 1)). 

Proof. For a node u, the probability that a randomly generated shortcut has length bigger than 
r is bounded above by 

fj*X- a 2TTxdx ^ cong ^ r2 „ a(logn)(a _ 2)/2) 

H, x- a 2Trxdx 

J \/ciogn 
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for all n sufficiently large. Since there are 2d shortcuts per node on average, the probability 
that two nodes u and v separated by distance r or more possess a shortcut between them is 
bounded above by the same function, but with the constant suitably modified. 

Now, for randomly chosen nodes s and t, r(s,t) > rS l / 2 ^ t whp, for any e > 0. Hence, there 
can be a path of length n 1 hops between s and t only if at least one of the hops is a shortcut of 
length n^ 1 / 2 )" 6-7 or more. By the above and the union bound, the probability of there being 
such a shortcut is bounded above by 

const, n 7 (n (1 ^- e -~^ ^ (log n) (a ~ 2)/2 . 
The exponent of n in the above expression is 

^(l-2e)+ 7 (a-l). 

The exponent above is negative for sufficiently small e > provided 7 < (a — 2)/(2(a — 1)). 
In other words, if this inequality is satisfied, then the probability of finding a route with fewer 
than n 1 hops goes to zero as n — ► 00. This establishes the claim of the theorem. □ 
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